OPEN 3 ACCESS Freely available online 



•0-PLOS I o-^E 



Discordance of Tuberculin Skin Test and Interferon 
Gamma Release Assay in Recently Exposed Household 
Contacts of Pulmonary TB Cases in Brazil 

Rodrigo Ribeiro-Rodrigues\ Soyeon Kim^, Flavia Dias Coelho da Silva\ Aleksandra Uzelac^, 
Lauren Collins^, Moi'ses Palaci^, David Alland^, Reynaldo Dietze\ Jerrold J. Ellner^, Edward Jones- 
Lopez^*, Padmini Salgame^* 

1 Cellular and Molecular Immunology Laboratory, Nucleo de Doen^as Infecciosas, UFES, Vitoria, Brazil, 2 Department of Preventive Medicine and Community Health, New 
Jersey Medical School, Rutgers Biomedical and Health Sciences, Newark, New Jersey, United States of America, 3 Division of Infectious Diseases, Department of Medicine, 
New Jersey Medical School, Rutgers Biomedical and Health Sciences, Newark, New Jersey, United States of America, 4 Boston University School of Medicine, Boston, 
Massachusetts, United States of America, SMycobacteriology Laboratory, Nucleo de Doen^as Infecciosas (NDI), Universidade Federal do Espi'rito Santo (UFES), Vitoria, 
Brazil, 6 Section of Infectious Diseases, Department of Medicine, Boston Medical Center and Boston University School of Medicine, Boston, Massachusetts, United States of 
America 



(D 

CrossMark 



Abstract 

Interferon-gamma (IFN-y) release assays (IGRAs) such as the Quantiferon Gold In-tube test are in vitro assays that measure 
IFN-y release from T cells in response to M. tuberculosis {IVltb)-speciflc antigens. Unlike the tuberculin skin test (TST), IGRA Is 
specific and able to distinguish IVltb-infection from BCG vaccination. In this study we evaluated the concordance between 
TST and IGRA and the efficacy of IGRA in diagnosing new IVltb infection in household contacts (HHC) of pulmonary 
tuberculosis (PTB) cases. A total of 357 HHC of TB cases in Vitoria, Brazil were studied. A TST was performed within 2 weeks 
following enrollment of the HHC and if negative a second TST was performed at 8-12 weeks. HHC were categorized as 
initially TST positive (TST+), persistently TST negative (TST-), or TST converters (TSTc), the latter representative of new 
infection. IGRA was performed at 8-12 weeks following enrollment and the test results were positive in 82% of TST+, 48% of 
TSTc, and 12% of TST-, indicating poor concordance between the two test results among HHC in each category. Evaluating 
CXCL10 levels in a subset of IGRA supernatants or lowering the IGRA cutoff value to define a positive test increased 
agreement between TST and IGRA test results. However, ROC curves demonstrated that this resulted in a trade-off between 
sensitivity and specificity of IGRA with respect to TST. Together, the findings suggest that until the basis for the discordance 
between TST and IGRA is fully understood, it may be necessary to utilize both tests to diagnose new Mtb infection in 
recently exposed HHC. Operationally, in IGRA negative HHC, it may be useful to employ a lower cutoff value for IGRA to 
allow closer monitoring for potential conversion. 
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Introduction 

Tuberculosis (TB), caused by Mycobacterium tuberculosis (Mtb), is 
still a major human health problem, especially in developing 
countries. The World Health Organization estimated that there 
were 8.8 million new cases of TB (12% co-infected with HIV) and 
1.34 million people died from TB in 2012 [1]. In most individuals 
exposed to Mtb, the infection is contained by the host immune 
response and remains latent. In latent tuberculosis infection 
(LTBI), the mycobacteria persist and can reactivate at a later time 
to cause active disease in about 10% of those infected over a 
lifetime (HIV uninfected). It has been estimated that one-third of 
the world's population has LTBI, serving as a vast reservoir from 
which new cases can emerge; hampering TB eradication. 
Therefore, a major challenge in TB control is to be able to 
diagnose, predict, and treat those with LTBI before they develop 



active disease. The diagnosis of Mtb infection is complicated by 
the lack of a practical gold standard test. In order to assess the 
predictive utility of diagnostic tests for LTBI, long-term follow-up 
of individuals from infection to disease is required. 

Until recently, the tuberculin skin test (TST) and chest X-ray 
examination have been the main diagnostic tools used for TB 
contact investigations. However, chest X-ray examination is only 
done if LTBI is present in an individual. For close to a century, 
TST was the only option for the diagnosis of LTBI. TST is based 
on in vivo cell-mediated immune response to Mtb antigens after 
intra-dermal injection of tuberculin purified protein derivative 
(PPD), a complex mixture of proteins [2] derived from a human 
strain of Mtb. A positive TST result indicates that an individual is 
infected with Mtb, but does not differentiate between latent and 
active disease [3-5] . TST values of ^ 1 0 mm are interpreted as 
positive in most immunocompetent individuals [6]. However, 
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false-positive TST may occur after infection witii environmental 
nontuberculous mycobacteria that share common antigens with 
Mtb or after BaciUe Calmette-Guerin (BCG) vaccination [6-8]. 
False-negative TST may occur in the presence of HIV infection 
and other immunosuppressive states or after recent vaccination 
with hve-attenuat(xi \ irai vaccines [9]. It is common knowledge 
that errors introduced by misplacing and/or misreading the TST 
may adversely affect accuracy [10]. False positive TST are also 
problematic in BCG \ accinated infants, if tested at age ^10 years. 
[11]. Therefore, the utility of TST in the identification of LTBI 
has be(;n challenging in child contacts in countries where BCG 
vaccination is widely administered in infancy, such as in Brazil. 
False positive TST remain problematic through adulthood in 
individuals vaccinated after the first year of life [11]. 

Access to in vitro methods measuring specific cellular immune 
respon.ses to Mtb offers potential advantages and may circumvent 
some limitations presented by TST. Identification of the highly 
specific RD 1 region in Mtb genome has enabled the development 
of new diagnostic tests for Mtb infection. Efforts have concentrated 
on methods for detecting IFN-y, a key Thl cell-associated cytokine 
necessary for effective host defense against Mtb (reviewed in [12]). 
Interferon-gamma (IFN-y) release assays (IGRAs) detect IFN-y 
release from lymphocytes after in vitro incubation of whole blood 
with Mtb antigens, such as early secretory antigenic target-6 
(ESAT-6) and culture filtrate protein 10 kD (CFP-10). IGRAs offer 
enhanced specificity by measuring IFN-y production in response 
to antigens that are specific to Mtb and, therefore, not influenced 
by cross-reactivity to A4. bovis BCG and most non-tuberculous 
mycobacteria (NTM) [13]. The commercially available ELISA- 
based QuantiFERON TB Gold IN-TUBE, which includes the TB 
7.7 (p4) as an additional antigen, is an example of such an in vitro 
IGRA test. 

HHC of pulmonar\' TB patients are at highest risk of Mtb 
infection and progression to TB disease [14]. Identification of 
infected HHC is ther(;f()rc' critical for TB control. Although 
IGRAs are more specific than TST, serial studies have shown 
frequent reversions and conversions [15]. In a recent study this 
temporal change in IGRA was attributed to the variability 
inherent to the test rather than driven by host and bacterial 
factors [16]. Only a small number of studies have performed both 
TST and IGRA to determine the prevalence of LTBI among close 
contacts of index TB cases [17-21] and several of these studies 
reported discordancy between IGRA and TST [19-21]. Indeed, 
findings from one study indicate that in endemic countries, TST 
results may be more informative than IGRA to initiate LTBI 
treatment [19]. Therefore, a side-by-side comparison of TST and 
IGRA in recentiy exposed HHC is essential in determining the 
utihty of IGRA in accurately identifying latent infection in a high- 
risk population, such as Brazil [22], where routine contact tracing 
programs are in place. The aim of this study was to determine the 
performance of IGRA compared to TST in recently exposed 
HHC as part of a larger International Collaljoration in Infectious Diseases 
Research (ICIDR) study designed to investigate the extent of Mtb 
transmission in HHC exposed to an index case (IC) with infectious 
pulmonary TB (PTB). 

Subjects and Methods 

Study population 

The ICIDR cohort began enroUment on March 13, 2008. Adult 
culture confirmed, AFB smear a 2-1- TB patients were enrolled as 
index cases; within two weeks of enrollment, their HHC were 
evaluated for Mtb infection and disease. Only HHC of index cases 
who fulfilled the following inclusion criteria were enrolled: 1) age 



S18 years with cough a 3 weeks; 2) new TB episode with Si 
sputum specimen that was AFB >2+ with subsequent growth of 
M. tuberculosis in culture, and; 3) living with >3 persons fulfilling 
the study definition of a contact. We excluded HIV-infected 
patients (or those who refused HIV testing), patients with a 
previous history of TB treatment, and patients who were too ill to 
consent, unable to understand, or comply with, the study protocol. 
A household contact was defined as an individual of any age 
fulfilling at least one of the following criteria of close contact with 
the index TB cases for S3 months before enrollment: 1) person 
sleeping under the same roof S5 days/week; 2) person sharing 
meals S5 days/week; 3) person watching TV nights or weekends; 
and, 4) other significant contact. Primary or secondary prophylaxis 
with isoniazid was offered to participants meeting guid<Jines of the 
Brazilian TB Control Program [23]. We obtained written 
informed consent and assent in Portuguese in accordance with 
age-specific ethical guidelines from participating institutions. 

Only those index cases and their HHC enrolled by March 19, 
2010 were included in this substudy. The following parameters 
were evaluated in HHC: a) presence of TB disease symptoms; b) 
TST result; and c) chest X-ray. Eight weeks later, HHC had: a) 
repeat TST (if <10 mm); b) IGRA testing using Quanti- 
FERON TB-Gold In-Tube; and c) evaluation of the household 
environment. Blood samples for IGRA were drawn at the second 
visit and obtained prior to application of a second TST, in those 
initially TST negative. 

TST 

The initial TST involved an intra-dermal injection of 0.1 mL (2 
tuberculin units) of the PPD RT23 (Statens Serum Institute, 
Copenhagen, Denmark) into the anterior surface of the forearm 
with a standard tuberculin syringe. Reactions were evaluated 72 to 
96 hours after the injection by two expert readers who are certified 
and have a concordance rate S95% with the local TST reference 
reader. A positive result on the TST was defined as an induration 
of > 1 0 mm in the transverse diameter using the ball pen method. 
If the first TST result was < 1 0 mm, then a second TST was 
performed 8—12 weeks following enrollment (Figure 1). TST 
conversion between 1'' and 2"'' TST readings was defined as those 
with initial TST <10 mm and 2°*^ TST >10mm. We also 
utilized a second definition of TST conversion (Menzies Criterion) 
that additionally required the PPD induration to increase by at 
least 6 mm. 

IGRA using QuantiFERON TB-Gold In-Tube 

Blood samples for IGRA testing were collected by venipuncture 
prior to intradermal injection of PPD using vials provided by the 
manufacturer. The blood samples were transported to the Niicleo 
de Doengas Infecciosas (NDI-UFES), at the Federal University of 
Espirito Santo, in Vitoria, Espirito Santo state, Brazil, and used for 
whole-blood IFN-y release assay (QuantiFERON TB Gold IN- 
TUBE, Qiagen, Valencia, CA, USA) according to the manufac- 
turer's instructions. For the whole-blood stimulation step, 1 ml 
aliquots of whole blood were incubated as follows: a) nil control 
(saline); b) MTB antigens (early secretory antigenic target 6 
[ESAT-6], culture filtrate protein 10 [CFP-10], and TB7.7 (p4)); 
and c) positive control (mitogen). Following stimulation, IFN-y 
levels in whole blood supernatants were quantified by ELISA. A 
positive result was defined as [(MTB antigen-stimulated IFN-y 
level) minus (nil-stimulated IFN-y level)] >0.35 lU/ml. This will 
be referred to as (TB Ag-Nil), hereafter. TST and IFN-y assay 
readings were performed independently, without knowledge of the 
outcome of the other. 
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Figure 1 . Pie charts show the number and percentage of HHC in each category. IGRA responses were analyzed in the three HHC subgroups 
(TST-, TST+ and TSTc) and the breakdown is diagrammatically represented. 
doi:1 0.1 371 /journal.pone.0096564.g001 



CXCL10 quantitation in IGRA supernatants 

CXCLIO ELISA on the IGRA supernatants was performed 
using the Meso Scale Discovery (MSD) platform (Meso Scale 
Discovery, Rockville, MD, USA). 

Data analysis 

A generalized estimating equation (GEE) approach was used to 
estimate confidence intervals for the percentage at or above IGRA 
cutoffs among TST-positive (TST+) and among TST converters 
(TSTc), and percentage below the cutoff among TST-negative 
(TST-), adjusted for clustering within a household. When defining 
a positive test using a combination of IGRA or CXCLIO values, 
the percentages were estimated using the inverse probability of 
being included in the CXCLIO sample as weights in GEE models. 
Non-parametric ROC curves were estimated and plotted using the 
pROC (http:/ /cran.r-project.org/) (pROC: an open-source pack- 
age for R and S+ to analyze and compare ROC curves [24] . For 
all other analyses SAS 9.2 software (SAS Institute Inc., Gary, NC, 
USA) was used. 

Ethics Statement 

The study was approved by Ethics on Human Research 
Commitee-Health Sciences Campus - Universidade Federal do 
Espirito Santo, Conselho National de Etica em Pesquisa (Brazilian 
National Council On Research Ethics), Boston University Medical 
Campus Institutional Review Board and RBHS Institutional 
Review Board. We obtained written informed consent and assent 
in Portuguese in accordance with age-specific ethical guidelines 
from participating institutions. Written parental consent was 
obtained for children <7 years and written assent and written 
parental consent were obtained for children between 7-17 years. 



This consent procedure was approved by IRBs of all participating 
institutions. 

Results 

Demographic characteristics of index cases (IC) and 
household contacts (HHC) 

Of the 467 HHC enroUed in the ICIDR study by the cutoff 
date, 110 participants could not be included in the analysis 
because: a) one had active TB disease; b) four had no TST and 
IGRA testing; c) nine had no TST testing; d) one had a week 8-12 
TST but no initial TST; e) twenty-tiiree were TST negative at 
week 1-2 but had no repeat TST at week 8-12; f) seventy-one had 
no IGRA testing; and g) one had an indeterminate IGRA result. 
Therefore, a total of 357 HHC distributed over 76 households 
were included in this study. For the IC, the median age was 33 
years (range: 18 to 81 years), 39% were female and the median 
body mass index (BMI) was 18.0 kg/ml For HHC, the median 
age was 20 years (range: 6 months to 87 years), 59.4% were 
female, and a BCG scar was reported in 77.3%. The observed 
median BMI among HHC aged 20+ years was 23.2 kg/m^ 

TST and IGRA Results 

Considering a TST induration ^10 mm as TST positive, 195 
(55%) out of the 357 HHC were positive at the T' TST [(TST+), 
Figure lA]. After a second TST test performed 8—12 weeks later 
on those negative at the first reading, 48 (13%) HHC became 
newly TST positive [(TSTc), Figure IB] and a total of 243 out of 
the 357 HHC were TST positive increasing the frequency from 
55% to 68% (Figure IB). The remaining 114 (32%) HHC were 
TST negative at both at T' and 2"*^ testing [(TST-), Figure IB]. 
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IGRA responses were analyzed in the three HHC subgroups 
(TST-, TST+ and TSTc) and the breakdown shown in Table I 
and diagrammatically represented in Figure 1 is as follows: i) Of 
the 1 14 TST- HHC, 100 (88%) were IGRA negative (TST-IGRA- 
); and 14 (12%) were IGRA positive [(TST-IGRA+) Figure IC]; ii) 
Of the 195 TST+ HHC, 159 (82%) were also IGRA positive 
(TST+IGRA+) and the remaining 36 (18%) were IGRA negative 
[(TST+IGRA-) Figure ID]; iii) Of the 48 TSTc HHC, 23 (48%) 
were also IGRA positive (TSTcIGRA+) and the remaining 25 
(52%) were IGRA- [(TSTcIGRA-) Figure IE]. Taken together, a 
total of 75 (21'/o) HHC showed discordance between TST and 
IGRA [Kappa = 0.57 (95% confidence interval (CI): 0.48, 0.65)], 
with 61 TST-HGRA- and 14 TST-IGRA-I-, a ratio of 4.4 (95% 
confidence interval (CI): 2.4, 7.8), indicating that, when tests were 
discordant, TST was more likely to be the test that was positive 
(p<0.001). 

Although there is no established gold standard for LTBI, our 
study treated TST as a surrogate reference standard for the 
purpose of elucidating the operational characteristics of IGRA 
with respect to TST. Our study design allowed us to compare TST 
converters to those TST positive at first evaluation. IGRA only 
identified 52% of converters (Figure IE), Whereas those HHC 
with TST positive may be a heterogeneous group, potentially 
including individuals with longstanding Mtb infection (unrelated 
to the index case exposure) and/ or BCG cross-reactive individuals 
in addition to recent infection (unobserved TST converters), 
IGRA identified 82% of these individuals (Figure ID), a 
significantiy greater percentage compared to TST converters 
(p<0.001). 

Analysis of the presence of BCG scars showed that the 
proportion of HHC with a scar was similar in the 6 sub-groups 
indicating that it was unlikely that the discordance was influenced 
by BCG vaccination (Table 1). In order to exclude the possibility 
that enhanced TST positivity could be related to BCG cross- 
reactivity, we compared TST and IGRA in only those HHC 
whose TST was negative (TST- and TSTc). Of tiie 162 HHC 
whose TST was negative (includes TST- and TSTc), 39 (24%) 
were discordant for TST and IGRA, (Kappa = 0.38 (95% CI: 
0.22, 0.54), with tiie ratio of TST converters/IGRA negative 
(n = 25) to TST negative (second test also)/IGRA positive (n = 14) 
being 1.8 (95% CI: 0.9, 3.4). This indicates a reduced but 
marginally significant (p = 0.078) greater estimated proportion of 
Mtb infection as defined by TST compared to IGRA even when 
restricting our sample to a population where the possibility of BCG 
cross-reactivity was minimized. Interestingly, with regards to 
exposure to the index case (measured as average hrs/ day exposure 
to the index case over a 3 month period), the discordant groups 
(TSTc/IGRA- and TST+/IGRA-) had a less intense exposure 
compared to the concordant positive groups (TSTc/IGRA-l- and 
TST-I-/IGRA-I-) (trend test p<0.001) (Table 1). 

Comparison of the magnitude of the TST response by PPD 
diameter in the TSTc at week 8-12 was similar to that of the 
TST+ group at entry; a median of 1 5 mm in TSTc compared to 
16 mm in TST-I- (p = 0.15) (Figure 2A). However, comparison of 
the magnitude of the IGRA responses in the three groups showed 
that in the TSTc, the majority of the responses clustered around 
the cut-oflF value with a median TB Ag - Nil of 0.22 lU/ml 
(Figure 2B). Furthermore, the median TB Ag - Nil value in the 
TSTc group, though statistically dilferent (p<0.001), was only 
0.21 lU/ml higher than that of the TST- group and below the 
0.35 lU/ml positivity level recommended by the manufacturer 
(Figure 2B). In contrast, the TSTc had a median TB Ag - Nil 
value (2.46 lU/ml) lower than that of the TST+ group, a 
substantial difference which was also statistically significant 



(p = 0.003). Together, these data indicate that there is a high level 
of IGRA discordance in the TSTc group and that the majority of 
the discordancy is due to IGRA results that are close to the cutoff 
value. 

Analysis of CXCL10 levels in IGRA supernatants in a 
subset of HHC 

CXCLIO, an IFN-y inducible chemokine involved in Thl cell 
migration to inflammatory sites [25] has been studied as an 
alternative marker for discriminating between LTBI and active TB 
[26-29] . Next, we determined whether discordance between TST 
and IGRA would be reduced by additionally determining 
CXCLIO levels in IGRA supernatants. Supernatants from 150 
participants were sampled, with those that w(-re discordant IGRA 
negative, i.e., TST-positive/IGRA-iiegative or TST converters/ 
IGRA-negative, oversampled. As shown in Figure SI, CXCLIO 
levels from both unstimulated and stimulated IGRA supernatants 
were significantly increased in subjects that were IGRA positive 
when compared to their negative counterparts. Since the levels of 
CXCLIO in the TSTc/IGRA- and TST+/IGRA- groups were 
similar to that in TST-/IGRA- group, the addition of CXCLIO 
measurement did not improve IGRA concordance with TST. 

Improving IGRA and TST concordance by changing IGRA 
cutoffs 

Receiver operating characteristic (ROC) curves were generated 
to show the trade-off of sensitivity and specificity by varying the 
quantitative IGRA cutoff defining a positive result, i.e., the TB Ag 

- Nil values when "true positive" is defined as HHC whose TST 
converted between T' to 2"'' tests (TSTc) and "true negative" is 
defined as HHC whose baseline and follow-up TST were both 
negative (TST-). A second curve was superimposed which uses the 
same "true negative" definition but defined "true positive" as 
HHC with an initial TST positive result (TST+). The ROC 
highlights estimates corresponding to the standard cutoff value (TB 
Ag - Nil = 0.35 lU/ ml) and two more liberal cutoff values (TB Ag 

- Nil =0.10 and 0.15 lU/ml) chosen by inspection post-hoc to 
afford favorable trade-offs between sensitivity and specificity. As 
shown in Figure 3, specificity and sensitivity obtained using the 
standard cutoff value (TB Ag - Nfl = 0.35 lU/ml) were 87.7% 
and 47.9"'o, respectively. When a cutoff value of 0.15 lU/ml was 
considered, sensitivity increased to 56.3%, and when the cutoff 
was 0.10 lU/ml sensitivity further increased to 62.5%. Although 
sensitivity was improved by the use of alternate cutoff values, 
specificity diminished considerably to 83.3% and 81.6%, for 
cutoffs of 0.15 and 0.10 lU/ml, respectively (Figure 3, Table 2). 
For virtually the entire range of cutoff values for TB Ag - Nil, the 
ROC curve for TST-h HHC lays above that for TSTc, indicating 
better sensitivity for any given specificity (Figure 3). This result 
indicates that IGRA is less useful for distinguishing TST negative 
individuals from TST converters as compared to TST negative 
from prevalent TST positive, regardless of what cutoff value was 
considered. 

Improving IGRA and TST concordance by using Menzies 
criterion for TST conversion 

To determine whether TST conversion was due to boosting, the 
data were also analyzed using the Menzies criterion which is 
designed to distinguish between true conversion and boosting [30] 
in the analysis. Analysis of the data by excluding TSTc who had a 
difference of <6 mm between the l*"' and 2""^ TST tests (not 
meeting the Menzies criterion) showed that the AUG for when all 
converters were included in the analysis versus converters using 
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Figure 2. Comparison of the magnitude of TST and IGRA responses. A) PPD Diameter for 1" and 2 TST in TST-negative (TST-), TST- 
converter (TSTc), and Prevalent TST positive (TST+) groups displayed by PPD testing visit and TST group*. B) IGRA results using Quantiferon Gold In- 
tube quantitative TB Ag - Nil by TST-, TSTc, and TST+ groups. 2"*^ TST is only performed when PPD diameter in TST1 is <10 mm. Dashed line in 
Figure 2A at 10 mm is the cutoff for a positive TST result. ^ indicates values greater than 10 lU/ml (prior to December 2009 levels >10 lU/ml were 
reported as 10+). A dashed line in Figure 2B is shown at 0.35 lU/ml, the cutoff for a positive IGRA test result. 
doi:1 0.1 371 /journal.pone.0096564.g002 



tlie Menzies criterion was similar overall (Figure 3). However, the 
curves diverged near the standard cutoff (TB Ag - Nil = 0.35 lU/ 
ml) and the sensitivity increased from 47.9% to 56. 1 % when using 
the Menzies criterion for TST conversion (Table 2). Small 
increases in sensitivity were seen when the two alternate cutoff 
values for IGRA were used. This result suggests that IGRA 



sensitivity increases when the TST result is corrected for possible 
boosting effects but is still less useful at distinguishing TST 
negative individuals from TST converters and than from TST+. 
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o 




Percentage<C among TST- 

Figure 3. ROC curves for TBag — Nil. Statistics displayed on the 
curves are C = X: Y, Z, where X is the TB Ag - Nil (lU/ml) cutoff value for 
which IGRA would be declared positive. Y is Percentage of TST-negative 
(Specificity) that is below the cutoff and Z are the Percentage of TST- 
positive or TST converters at or above the cutoff (Sensitivity). 
doi:1 0.1 371 /journal.pone.0096564.g003 

Improving IGRA and TST concordance by inclusion of 
CXCL10 values 

In addition to evaluating alternate IGRA cutofT values to 
improve IGRA and TST concordance, several combinations of 
IGRA and CXCLIO values were analyzed, using the CXCLIO 
stimulated/ unstimulated ratio or just the stimulated value alone 
(Table 2). As depicted in Table 2, inclusion of CXCLIO levels to 
determine test positivity showed that there was an improvement in 
concordance in both the TST+ and TSTc groups; however, there 
was also an increase in positivity in the TST- group (Table 2). For 
example, using a CXCL 1 0-stimulated threshold of 8500 pg/ ml for 
a positive result in addition to those who had TB Ag - Nil > 
0.35 lU/ml to define a positive test gave sensitivity for detecting 
TSTc of 61.9%, sensitivity for TST+ of 86.0%, and specificity for 
TST- of 78.7%. With the addition of CXCL 1 0-stimulated > 
8500 pg/ml to define a positive test, an additional 14.0% of the 
TSTc tested positive, but at the cost of an additional 9.0% of TST- 
HHC testing positive. Using a CXCLIO strmulated/unstimulated 
ratio cutoff of 8 had largely similar trade-oflFs. Trade-offs achieved 
by using CXCLIO to improve sensitivity could be similarly 
achieved by decreasing the threshold of TB Ag - Nil for a positive 
test to 0.10 lU/ml (Table 2). 

Differences in IGRA and TST concordance by BCG status 

Of the 357 household contacts, BCG scar was present for 276, 
absent for 68, and missing for 13; the latter were excluded. In the 
BCG-negative subgroup, a higher percentage was estimated to be 
positive for TST and IGRA compared to the BCG-positive but 
these differences were not statistically significant. While we found 
similar Kappa statistics comparing final TST and IGRA results by 
BCG status (Kappa = 0.54 and 0.56 for BCG- and BCG+ 
subgroups, respectively), the percentage of the TSTc with positive 
IGRA was higher in BCG- subjects than among the BCG+ (7 1 .4% 



vs 41.0%), respectively); however, this difference was not statisti- 
cally significant (p = 0.22). The percentage IGRA-negative among 
TST- was similar between BCG- and BCG+ (83.3% vs 87.9%, 
respectively, p — 0.62) (Table SI). When we compared the TB Ag 
- Nil ROC curves and used the area-under-the-curve (AUC) to 
quantify the trade-off in sensitivity and specificity using TSTc and 
TST- contacts as the gold standard between BCG- and BCG+ 
groups, we found a significant difference in the AUC (p = 0.007) 
(Figure S2), indicating that TB Ag - Nil better separates TSTc 
from TST- among BCG- compared to BCG+. Without a gold 
standard for Mtbinfection it is not clear whether the difference in 
AUC is due to superior identification of infection in the BCG- 
subgroup by TST, IGRA, or some combination of the two. 

Discussion 

Once IGRA became commercially available, numerous studies 
were conducted to determine the performance of the test, 
including sensitivity, specificity and accuracy. A systematic review 
of the literature and meta-analysis of the data therein found that 
commercial IGRAs had a higher positive and negative predictive 
value for progression from latent infection to active disease [31]. 
Findings from another meta-analysis review also showed that the 
IGRA test had high specificity and was not influenced by BCG 
vaccination [32]. In Korea, a country of intermediate TB 
incidence, IGRA correlated significantly better with increased risk 
of infection among contacts compared with the TST using both 
10 mm and 15 mm cutoffs [33]. However, discordant TST and 
IGRA results [20,21,34], concerns regarding the test performance 
characteristics of IGRA [35—39] and inconsistency in IGRA test 
results [16] have been reported. In the present study we evaluated 
the performance of a commercial IGRA (QuantiFERON TB 
Gold IN-TUBE) in comparison to TST as part of a HHC study 
that we are conducting in Vitoria, Brazil. We found 21% 
discordance between the two tests and surprisingly a greater 
number of HHC were TST positive and IGRA negative 
compared to those who were TST negative and IGRA positive. 

BCG vaccination when given after the first year of age results in 
larger and persistent TST reactions while vaccination in infancy 
has minimal effect on TST, when the TST is performed 10 years 
or more after BCG vaccination [11]. Since the Brazilian 
population is BCG vaccinated at birth and the median age of 
contacts was 20 years, BCG cross-reactivity is unlikely to explain 
the substantial discordance observed in the TST positive HHC 
who are IGRA negative. We did not perform separate analysis for 
study participants £5 years of age since only 33 children, 
including 13 with positive 1" TST and 3 TST converters, were 
£5 years. Therefore in our study we were not able to assess 
whether BCG status was a contributing factor among younger 
HHC. However, a study that reported only moderate agreement 
between IGRA and TST in children with LTBI, found that the 
discordance was not due to BCG vaccination induced false- 
positive TST test [34]. The observations that IGRA negativity is 
present in the TST converter group and that percentage of HHC 
with BCG scars is similar in the discordant and concordant 
subgroups further support that the TST and IGRA discordancy is 
unlikely to be due to cross-reactivity from BCG vaccination. On 
the other hand, when only defining TST converters as true 
positives, TST and IGRA concordance was higher in the BCG- 
group as summarized by the ROC AUC, but these results are still 
difficult to interpret in the absence of a true gold standard for TB 
infection. Other studies have also reported that BCG does not 
influence TST and IGRA test discordance in non-pediatric 
populations [40,41]. 
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Table 2. Sensitivity and specificity using alternate cutoff values. 



Definition of Test+ 


Statistic 


Estimate 




TB Ag-NilaO.35 lU/ml 


Pr(T+| Prevalent TST-I-) 


81.5% 


(74.1%, 


87.2%) 




Pr(T+|TST Converter) 


47.9% 


(34.6%, 


61.5%) 




Pr(T+|TST Converter excluding differences<6 mm) 


57.9% 


(40.9%, 


73.2%) 




Pr(T-|TST-) 


87.7% 


(93.3%, 


78.6%) 


TB Ag-Nils0.15 lU/ml 


Pr(T+| Prevalent TST-I-) 


86.7% 


(80.2%, 


91.2%) 




Pr(T+| TST Converter) 


56.3% 


(41 .4%, 


70.0%) 




Pr(T+|TST Converter excluding differences<6 mm) 


60.5% 


(43.1%, 


75.6%) 




Pr(T-|TST-) 


83.3% 


(90.1%, 


73.2%) 


TB Ag-NllaO.lO lU/ml 


Pr(T+| Prevalent TST-I-) 


89.7% 


(84.1%, 


93.6%) 




Pr(T+| TST Converter) 


62.5% 


(47.3%, 


75.6%) 




Pr(T+|TST Converter excluding differences<6 mm) 


63.2% 


(44.6%, 


78.5%) 




Pr(T-|TST-) 


81.6% 


(88.8%, 


71.2%) 


TB Ag-NilaO.35 lU/ml, CXCLIO Stimulated/Unstimulated Ratioa? 


Pr(T+| Prevalent TST-I-) 


85.3% 


(74.7%, 


91.9%) 




Pr(T+| TST Converter) 


61.9% 


(39.3%, 


80.3%) 




Pra-|TST-) 


78.7% 


(88.9%, 


63.2%) 


TB Ag-NilaO.35 lU/mi, CXCLIO Stimulated/Unstimulated >8 


Pr(T+| Prevalent TST-I-) 


84.6% 


(73.9%, 


91.4%) 




Pr(T+| TST Converter) 


61.9% 


(39.3%, 


80.3%) 




Pr(T-|TST-) 


78.7% 


(88.9%, 


63.2%) 


TB Ag-NilsO.35 lU/ml, CXCLIO Stimulated/Unstimulated alO 


Pr(T+| Prevalent TST-I-) 


83.1% 


(72.5%, 


90.2%) 




Pr(T+| TST Converter) 


55.0% 


(30.2%, 


77.5%) 




Pr(T-|TST-) 


83.2% 


(91.8%, 


68.7%) 


TB Ag-NilaO.35 lU/ml, CXCLlO-stimulatedsS.SOO pg/ml 


Pr(T+|Prevalent TST-I-) 


86.0% 


,76.6%, 


92.1%) 




Pr(T+| TST Converter) 


61.9% 


(36.1%, 


82.4%) 




Pr(T-|TST-) 


78.7% 


(87.8%, 


65.6%) 


TB Ag-NilaO.35 lU/ml, CXCLlO-stimulated >1 1,000 pg/ml 


Pr{T+| Prevalent TST-I-) 


86.0% 


(76.6%, 


92.1%) 




Pr(T+| TST Converter) 


55.0% 


(31.1%, 


76.8%) 




Pr(T-|TST-) 


83.2% 


(91.1%, 


70.6%) 


TB Ag-NilaO.35 lU/ml, CXCLlO-stimulated >12,000 pg/ml 


Pr(T+| Prevalent TST-I-) 


86.0% 


(76.6%, 


92.1%) 




Pr(T+| TST Converter) 


51.5% 


(28.1%, 


74.3%) 




Pr(T-|TST-) 


85.5% 


(93.1%, 


72.0%) 



Identification of prevalent TST+, TST converters, and TST- using I) TB Ag — Nil cutoffs, ii) combinations of TB Ag-Nil and CXCL1 0 Stimulated levels or CXCL1 0 Stimulated/ 
Unstimulated ratio and iii) Menzies definition (excludes those with difference in TST<6 mm), and TST- using TB Ag — Nil cutoff OF 0.25 lU/ml. Participants come from 74 
(IGRA testing)/46 (IGRA and CXCLIO testing) households. Estimation uses a GEE approach with an independent working correlation matrix to adjust for clustering in 
households. IVlodels based on the sample that included CXCLIO cutoffs were weighted to reflect the larger study cohort. 
Pr(T-F|Prevalent TST-t-) is the percentage test-positive among those who are were TST-t- at week 1-2. 

Pr(T-F|TST Converter) is the percentage test-positive among those who are TST-converters (TST- at week 1-2 and TST-f at week 8-12). 

Pr(T-F|TST IVlenzies Converter) is the percentage test-positive among those who are TST-converters (TST- at week 1-2 and TST-F at week 8-12 and change of at least 
6 mm). 

Pr(T-| TST-) is the percentage test-negative among those who are TST-negative at weeks 1-2 and 8-12. 
doi:1 0.1 371 /journal.pone.0096564.t002 



In a study by Aiiibarro and colleagues [42] the concordance 
between TST and IGRA tests in HHC, as well as correlation of 
both tests with the degree of TB exposure, both at baseline and at 
2 months were evaluated. The authors reported that concordance 
between TST and IGRA, and correlation with intensity of Mtb 
exposure, both were improved at the 2-month time period. 
Consistent with this report, we also observed that HHC that were 
TST positive but IGRA negative had less exposure to the index 
case compared to the concordant group. These data suggest that 
following Mtb exposure, IGRA conversion may take longer than 
TST. Clearly, negative IGRA tests should be interpreted with 
caution, particularly in a high risk setting given that TST performs 
rather well in high Mtb burden settings [43] . 



A study aimed at assessing the discordance between IGRA, T- 
SPOT. TB, and TST for the diagnosis of LTBI in a low prevalence 
population found that the three tests identified difiFerent people as 
test positive, suggesting that most positives from IGRA, T- 
SPOT.TiB, and TST are false-positives in low-prevalence popu- 
lations [44]. False positive IGRA test also pose a significant 
challenge with detection of conversion in health care workers 
(HCW') from low-risk settings [45]. As demonstrated in this 
retrospective study, 64.8% of the HCW that had converted IGRA 
showed reversion to negative on a repeat IGRA test. However, if 
the cutoff values were increased to 5.3 lU/ml from the 
manufacturer's cutoff value of &0.35 lU/ml, then the conversion 
rates were very similar to the historical conversion rate seen with 
TST in HCW in the institute where the study was performed [45] . 
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Serial IGRA testing in HCW in Cape Town South Africa showed 
significant intra-individual variability in the magnitude of the IFN- 
Y responses [46] . Another study conducted in German HCW also 
reported within-subject variability of serial IGRA tests, particularly 
those with values around the cutoff [47] . Significant variabilit)' was 
also observed in IGRA test results upon retesting of the same 
samples conducted in a low TB incidence setting [16]. A 
conclusion that was derived from the systematic review of all 
IGRA studies performed in HCW was that IGRAs are not 
superior to TST in identifying the incidence of new Mtb infection 
[48]. These studies show the importance of interpreting both TST 
and IGRA results in the context of exposure and prevalence. Serial 
studies in larger cohorts are necessary to determine whether IGRA 
reversion in an individual is an indication of Mtb infection that is 
cleared or whether it is truly false positive. In this regard, there is 
growing consensus that the current IGRA cutoff value requires 
reevaluation and borderline results need to be interpreted with 
prudence [16,49]. 

Quantitation of additional analytes in IGRA supernatants has 
been to shown to distinguish between active disease and Mtb 
infection in children [50]. Inclusion of CXCLIO improved the 
performance of the IGRA test and helped to discriminate infected 
from non-infected subjects [51]. In response to Mtb-specific 
antigens, high-risk children expressed significandy higher levels of 
CXCLIO and IL-2 than low-risk groups and the levels strongly 
correlated with Mtb exposure and grade of infectiousness of index 
cases. The study also found an excellent agreement between 
CXCLIO, IL-2 and IGRA [28]. In the present study, we found 
that by either lowering the IGRA cutoff value or by including the 
analysis of CXCLIO in the IGRA supernatants, sensitivity was 
enhanced but at the cost of compromising the specificity of the 
IGRA test and that we could obtain similar trade-offs by either 
strategy. Additional studies are required to determine whether 
inclusion of the evaluation of other cytokines/chemokines will 
improve IGRA's sensitivity. 

Variability in repeat test results and poor specificity in certain 
populations where sensitivity to non-tub(;r{ ulous mycobacteria is 
widespread [52] prompted modification of the initial 10 mm cutoff 
value for TST. Interpretation of the TST results now takes into 
account the chnical setting, with the current recommendation of 
15 mm cutoff for low prevalence population and a 5 mm cutoff for 
high-risk population [53]. Similarly, there is growing consensus 
that IGRA cutoff of 0.35 lU/ ml for distinguishing positive and 
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